CDS

Accession Number TCMCG036C00255
gbkey CDS
Protein Id PTQ49969.1
Location complement(join(460812..461732,461862..462102,463425..463616,463878..463960,464122..464211,464446..464589,464806..464886,465070..465243,465465..465557,465759..465830,466174..466254,466391..466459,466611..466686,466876..466995,467109..467184,467392..468130))
GeneID Phytozome:Mapoly0001s0039
Organism Marchantia polymorpha
locus_tag MARPO_0001s0039

Protein

Length 1083aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772673.1
Definition hypothetical protein MARPO_0001s0039 [Marchantia polymorpha]
Locus_tag MARPO_0001s0039

EGGNOG-MAPPER Annotation

COG_category JK
Description CCAAT enhancer-binding protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03009        [VIEW IN KEGG]
KEGG_ko ko:K14832        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGGAAAATTGCAGGGCAGGGCAAGGCGAGGAGCAAGCCTGTAGAAGAATTAGGAAAAATTGAAGCCATTCAGTCGGATGTTGCTTCGTTCGCCGCGAGCTTAGGATTAGCTGCAGGAGCAGGAGTATCTTCTGGATTTAATGATTCGGATTTCCGTAAAGTAGGATCGATCAGGAAGCCTGAGCAACCAAAGCAGGCAGTTGTAGAGAAGAAGAAGAAAGCGACGGAAACTGGTAAGAGCGCTGCTAGAAAAGACGACAAAGGAATCCGTTCTGGAAAAACTCGTCGTGTGGACGCGGACAGGAAGCAAGGGAGCAAGGAGAATGCTGGAGGCAACAACAGAGGGAAAAGATTGGCGAATGGCCTTGAAGGCAAAGATGCGAATGGTCCCGGTGACAAAAGTCCCGGTGGGTGGAAGGCGAAGGAAAATCACGCGAATCCTAAGAAGCGTAAAGGTAGCTGGGAAGTTGGAGCTGATGGTGAAAGCGAAGTAAAGCGGCCTCTGGTTGAGACACCGTATGGTGAGCCCGAATTTGTGAAGAAGTTGCGATCAGAGGCTCAGGACTTACTGGAGAAGGCCGCTAATGATTTTGAAAAAAACCGATCCAGAGATAAAGATGCGGAGTGGCTCTTGAAAGCCCGGCGATCTGGAACTTCCGCCGATAAAGTGGCTGCCATGACAGTTATCTTGCAAGAAAATCCGAAAGCCAACGTGCGTACATTGGACGCCCTATTAGGGATGATGACCTCCAAAGCAGGAAAACGTCACGCAGCAACTGGAATCGACGCACTGAAGGAATTATTTTTGTACAATCTTCTTCCTGACCGCAAACTGAAGTACTTTGCTCAACAGCCTCTTGCAACCCTTCCTGAGGGCAAAGAGCGTTCTTCATTGCTGCTGGATTGGATTTGGGAAGACTGCTTAAAGCACAGGTTTGAGCGATTTGTGATTTCTCTTGACGAGGCTTCGAAAGACAAATTACCTTTCTTGAAGGAGAAAGCCCTCAAGTCAGTTTACGAACTTCTGAAGACAAAGCCAGAACAAGAAAGGCGGCTTTTGTCCACTCTAGTGAACAAGCTAGGAGATCCGGAGCGTAAAGTGGCATCGAATGCTTGCTACCTTCTATCCAGTCTACTGACAGCTCATCCGAACATGAAGAAAATTGTTGTTGATGAAGTCGACGGGTTTGTGTTCCGACCCCACGTCGGTCTCAGAGCGAGGTATTACGCGACAGTCTTTTTGAACCAAATAGTTCTCAGTGTCAAAGGTGATGGTCCTAAGCTGGCAAAGCAGCTCATTGATCTCTATTTTGCCCTTTTCAAGGCTGTGACAACTGGTGAGCAAGATCTCAATAAGGAGGAGAACGGAAAGAAGGGAACGAAGGGGGAGAAGGAGAGAGTTCGGAGAGGCAAGCGAAAGGAAAGAAATAACGACAAGTCACTAGCTGCTGACTTTACAACGGAGATTGATTCAAGGCTGCTGTCTGCCCTTCTGACTGGTGTCAATCGAGCATTTCCATATATTTCTGCAGAAGATCTGGATACAGTCACACAAGAAAATTCTGTTCTTTTCAGATTGGTACATTCGACCAACTTCAATGTAGCAGTGCAAGCATTGATGCTTCTCCATCAGCTGATGGTCAAGAACCAGGCCGTCAATGACCGTTTCTATCGAGCTTTGTATTCCGTCCTTCTGTCTGAAAGCCTTGCGAAGTCATCGAAGGCCGAAATGTTTCTCGGGCTTATCTTTAAGGCTCTAAAATCTGATGTCGATGTTAGGCGCATGTCAGCTATAGCGAAGCGCCTGACTCAGGTGGCAATTCAGCAGACACCGGAATTTGCCTGTGCTTCACTCTTCCTCATCTCAGAGATTCTCAAACTGAAACCCACTCTTTGGAATTCGGTGTTAAATGCCGAAGATCATGACGACGACCGTGAGCACTTTGAGGATCAGGGAGAAGAAAGTGATGACAATGAAAGAGGCAATACTGAGAAAGCCAATGGCCATAAAGAAGACAGTCAAGACTCAGGAGATACGTGGCCAGAAAAAGGATACTATGATCCTAAAGCTCGTGATCCACTTTACTGCCAAGCACATAGAGCATGCTGGTGGGAGCTCACCGTTCTTGCAAAGCACGTACATCCGTCAGTGGCAGCAATGGCTCGTACTCTTCTTTCAGGTGGCAACATCATTTATAGTGGAGACCCCTTGCGGGATCTTGCCTTGGGAGTTTTCCTTGACCGGTTTGTTGAGAAGAAACCTAAAGCCACCAAAAGGAAGACAGAAGGAACATGGCACGGCTCGTCTATGGCTTTGCCCGCGAAACTGGACGGAGCGAAATCTGCGGGACCAGTAGGTGAGGACATTCTGAAGTTGGCCGAAGAAGATGTAGCGCCTGAGGATGTCGTCTTCCACAAGTTTTATTCGACGAAGTCAATAAGAAGCAAACCTCAGAAGAAGAAGAAGAGCAAGGTAAATTCTGAGGAAGACGTCCTAGGATTGGACGAGGTTGCAGCTTTTGACGGAGACGACAGTGAAAACGAAGAAATCGACGAGTTACTTGAGCAGGAAGCAGGTGAGGAGATGATGGCAGATGATAGTGAAGGTGAAGAAATGGAGAGTGACGACGAAGAGATGTTGTACTACAAGAAAGATGAGGATGATGACCACGATTTAGAATCAGATGTAGAAGAAGGAGATAGTGAAGGGGATGAAGCAAATGCTCTCCTAGAAGACGGACTAGAGATGAGTAGTAGTGATGGGGAGGAAATTGAAAGCGAAGAAGACGAGATGGGCAGTGAAGAAGAAGATTTCCTGCCTAAGAAGGGTAAGAAACAGAGTGCTGGTAGGCAATCATCTAAAAAACGTGCCTCAGTTTTTCAAGAAGCAGAGGATTCGGACGGTGAAGTGGTTGGTGTCGAAGATGAGGATATGCTGCCTAGGAAGCAAAGACTTCAAACTCCTGCTCGGGGGCCGGCTAAGAAACGTGCATCCCCTTTTCAGGAACAGGAGGATTCTGACTTTGATGTAGCTAGTGAAGCAGATTCCGCACTCAGAAAAAGAAACAAGAAAATAGTCAAGAGACGTCTTTCTGTTTTTCAGGAGGCTGGATCCGACAGTGATGATGCCGGACTAATAAAACTTTCGGGTAGAAATCAACGTAGTATCAACAGAATAGGGAAGAATTCAGGAGCCCAAGCAGTAAGTGCTTCACGTAGCAAGAAAATTCGCAGCAATCGCAAAATGTGA
Protein:  
MGKIAGQGKARSKPVEELGKIEAIQSDVASFAASLGLAAGAGVSSGFNDSDFRKVGSIRKPEQPKQAVVEKKKKATETGKSAARKDDKGIRSGKTRRVDADRKQGSKENAGGNNRGKRLANGLEGKDANGPGDKSPGGWKAKENHANPKKRKGSWEVGADGESEVKRPLVETPYGEPEFVKKLRSEAQDLLEKAANDFEKNRSRDKDAEWLLKARRSGTSADKVAAMTVILQENPKANVRTLDALLGMMTSKAGKRHAATGIDALKELFLYNLLPDRKLKYFAQQPLATLPEGKERSSLLLDWIWEDCLKHRFERFVISLDEASKDKLPFLKEKALKSVYELLKTKPEQERRLLSTLVNKLGDPERKVASNACYLLSSLLTAHPNMKKIVVDEVDGFVFRPHVGLRARYYATVFLNQIVLSVKGDGPKLAKQLIDLYFALFKAVTTGEQDLNKEENGKKGTKGEKERVRRGKRKERNNDKSLAADFTTEIDSRLLSALLTGVNRAFPYISAEDLDTVTQENSVLFRLVHSTNFNVAVQALMLLHQLMVKNQAVNDRFYRALYSVLLSESLAKSSKAEMFLGLIFKALKSDVDVRRMSAIAKRLTQVAIQQTPEFACASLFLISEILKLKPTLWNSVLNAEDHDDDREHFEDQGEESDDNERGNTEKANGHKEDSQDSGDTWPEKGYYDPKARDPLYCQAHRACWWELTVLAKHVHPSVAAMARTLLSGGNIIYSGDPLRDLALGVFLDRFVEKKPKATKRKTEGTWHGSSMALPAKLDGAKSAGPVGEDILKLAEEDVAPEDVVFHKFYSTKSIRSKPQKKKKSKVNSEEDVLGLDEVAAFDGDDSENEEIDELLEQEAGEEMMADDSEGEEMESDDEEMLYYKKDEDDDHDLESDVEEGDSEGDEANALLEDGLEMSSSDGEEIESEEDEMGSEEEDFLPKKGKKQSAGRQSSKKRASVFQEAEDSDGEVVGVEDEDMLPRKQRLQTPARGPAKKRASPFQEQEDSDFDVASEADSALRKRNKKIVKRRLSVFQEAGSDSDDAGLIKLSGRNQRSINRIGKNSGAQAVSASRSKKIRSNRKM